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Encoded self-assembling chemical libraries (ESACHEL) 

20 



Problem to be solved: 

25 The isolation of specific binding molecules (e.g. organic molecules) is a central 
problem in chemistry, biology and pharmaceutical sciences. Typically, millions of 
molecules have to be screened, in order to find a suitable candidate. The prepa- 
ration of very large libraries of organic molecules is typically cumbersome. Fur- 
thermore, the complexity associated with the identification of specific binding 

30 molecules from a pool of candidates grows with the size of the chemical library to 
be screened. 
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Solution: 

In this invention, we use self-assembling libraries of organic molecules (typically 
forming dinners, trimers or tetramers), in which the organic molecules are linked 
to an oligonucleotide which mediates the self-assembly of the library and/or pro- 
5 vides a code associated to each binding moiety. The resulting library can be very 
large (as it originates by the combinatorial self-assembly of smaller sub-libra- 
ries). After the capture of the desired binding specificities on the target of inter- 
est, the "binding code" can be "decoded" by a number of experimental tech- 
niques (e.g., hybridization on DNA chips or by a modified polymerase chain reac- 
10 tion (PCR) technique followed by sequencing). 



INTRODUCTION 

15 The isolation of specific binding molecules (e.g., organic molecules) is a central 
problem in chemistry, biology and pharmaceutical sciences. For example, the 
vast majority of the drugs approved by the U.S. Food and Drug Administration 
are specific binders of biological targets which fall into one of the following cate- 
gories: enzymes, receptors or ion channels. The specific binding to the biological 

20 target is not per se sufficient to turn a binding molecule into a drug, as it is 

widely recognized that other molecular properties (such as pharmacokinetic be- 
haviour and stability) contribute to the performance of a drug. However, the iso- 
lation of specific binders against a relevant biological target typically represents 
the starting point in the process which leads to a new drug [Drews J. Drug dis- 

25 covery: a historical perspective. Science (2000) 287:1960-1964]. 

The ability to rapidly generate specific binders against the biological targets of 
interest would be invaluable also for a variety of chemical and biological applica- 
tions. For example, the specific neutralization of a particular epitope of the intra- 
30 cellular protein of choice may provide information on the functional role of this 
epitope (and consequently of this protein). In principle, the use of monoclonal 
antibodies specific for a given epitope may provide the same type of information 
[Winter G, Griffiths AD, Hawkins RE, Hoogenboom HR. Making antibodies by 
phage display technology. Annu Rev Immunol. (1994) 12:433-455]. However, 
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most antibodies do not readily cross the cell membrane and have to be artificially 
introduced into the cell of interest. In principle, intracellular antibodies can also 
be expressed into target cells by targeted gene delivery (e.g., by cell transfection 
with DNA directing the expression of the antibody). In this case, the antibody 
5 often does not fold, as the reducing intracellular milieu does not allow the forma- 
tion of disulfide bonds which often contribute in an essential manner to antibody 
stability [Desiderio A, Franconi R, Lopez M, Villani ME, Viti F, Chiaraluce R, Con- 
salvi V, Neri D, Benvenuto E. A semi-synthetic repertoire of intrinsically stable 
antibody fragments derived from a single-framework scaffold. J Mol Biol. (2001) 
10 310: 603-615]. High affinity binding molecules amenable to chemical synthesis 
may provide a valuable alternative to antibody technology. 

In Chemistry and Materials Sciences, the facile isolation of specific binding mole- 
cules may be useful for purposes as diverse as the generation of biosensors, the 
15 acceleration of chemical reactions, the design of materials with novel properties, 
the selective capture/separation/immobilization of target molecules. 

The generation of large repertoires of molecules (e.g., by combinatorial chemis- 
try; Otto S, Furlan RL, Sanders JK. Dynamic combinatorial chemistry. Drug Dis- 

20 cov Today. (2002) 7: 117-125), coupled to ingenious screening methodologies, is 
recognized as an important avenue for the isolation of desired binding specifici- 
ties. For example, most large pharmaceutical companies have proprietary chemi- 
cal libraries, which they search for the identification of lead compounds. These 
libraries may be as large as > 1 million members and yet, in some instances, not 

25 yield the binding specificities of interest [Bohm HJ, Stahl M. Structure-based li- 
brary design: molecular modeling merges with combinatorial chemistry. Current 
Opinion in Chemical Biology (2000) 4: 283-286]. The screening of libraries con- 
taining millions of compounds may require not only very sophisticated synthetic 
methods, but also complex robotics and infrastructure for the storage, screening 

30 and evaluation of the members of the library. 



The generation of large macromolecular repertoires (e.g., peptide or protein li- 
braries), together with efficient biological and/or biochemical methods for the 
identification of binding specificities (such as phage display [Winter, 1994], pep- 
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tides on plasmids [Cull MG, Miller JF, Schatz PJ. Screening for receptor ligands 
using large libraries of peptides linked to the C terminus of the lac repressor. Proc 
Natl Acad Sci USA. (1992) 89: 1865-1869] ribosome display [Schaffitzel C, Ha- 
nes J, Jermutus L, Pluckthun A. Ribosome display: an in vitro method for selec- 
5 tion and evolution of antibodies from libraries. J Immunol Methods. (1999) 231: 
119-135] yeast display [Boder ET, Wittrup KD. Yeast surface display for screen- 
ing combinatorial polypeptide libraries. Nat Biotechnol. (1997) 15: 553-557], 
periplasmic expression with cytometric screening [Chen G, Hayhurst A, Thomas 
JG, Harvey BR, Iverson BL, Georgiou G. Isolation of high-affinity ligand-binding 

10 proteins by periplasmic expression with cytometric screening (PECS). Nat 

Biotechnol. (2001) 19:537-542], iterative colony filter screening [Giovannoni L, 
Viti F, Zardi L, Neri D. Isolation of anti-angiogenesis antibodies from a large 
combinatorial repertoire by colony filter screening. Nucleic Acids Res. (2001) 29: 
E27] etc.) may allow the isolation of valuable polypeptide binders, such as spe- 

15 cific monoclonal antibodies, improved hormones and novel DNA-binding proteins. 
In contrast to conventional chemical libraries, protein libraries in the embodi- 
ments mentioned above may allow the efficient screening of as many as 1-10 
billion individual members, in the search of a binding specificity of interest. On 
one hand, the generation of libraries of genes (e.g., the combinatorial mutagene- 

20 sis of antibody genes; Winter, 1994; Viti F, Nilsson F, Demartis S, Huber A, Neri 
D. Design and use of phage display libraries for the selection of antibodies and 
enzymes. Methods Enzymol. (2000) 326:480-505) can directly be translated into 
libraries of proteins, using suitable expression systems (e.g. bacteria, yeasts, 
mammalian cells). Furthermore, methods such as phage display produce parti- 

25 cles in which a phenotype (typically the binding properties of a protein, displayed 
on the surface of filamentous phage) is physically coupled to the corresponding 
genotype (i.e., the gene coding for the protein displayed on phage) [Winter, 
1994], allowing the facile amplification and identification of library binding mem- 
bers with the desired binding specificity. 

30 

However, while biological/biochemical methods for the isolation of specific binding 
biomacromolecules can provide very useful binding specificities, their scope is 
essentially limited to repertoires of polypeptides or of nucleic acids [Brody EN, 
Gold L. Aptamers as therapeutic and diagnostic agents. J Biotechnol. (2000) 
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74:5-13]. For some applications, large biomacromolecules (such as proteins or 
DNA) are not ideal. For example, they are often unable to efficiently cross the cell 
membrane, and may undergo hydrolytic degradation in vivo . 

5 In an attempt to mimic biological/biochemical methods for the identification of 
organic molecules with desired binding properties, out of a chemical library, 
Brenner and Lerner [Brenner S, Lerner RA. Encoded combinatorial chemistry. 
Proc Natl Acad Sci USA. (1992) 89: 5381-5383] have proposed the use of en- 
coded chemical libraries (ECL). In their invention, the authors conceived a proc- 

10 ess of alternating parallel combinatorial synthesis in order to encode individual 
members of a large library of chemicals with unique nucleotide sequences. In 
particular, the authors postulated the combinatorial synthesis of polymeric 
chemical compounds on a solid support (e.g., a bead), where a step in the com- 
binatorial synthesis would be followed by the synthesis (on the same bead) of a 

15 DNA sequence, to be used as a "memory tag" for the chemical reactions per- 
formed on the bead. In typical applications, DNA-encoded beads would be incu- 
bated with a target molecule (e.g., a protein of pharmaceutical relevance). After 
the DNA-tagged bead bearing the polymeric chemical entity is bound to the tar- 
get, it should be possible to amplify the genetic tag by replication and use it for 

20 enrichment of the bound molecules by serial hybridization to a subset of the li- 
brary. The nature of the polymeric chemical structure bound to the receptor 
could be decoded by sequencing the nucleotide tag. 

The ECL method has the advantage of introducing the concept of "coding" a par- 
25 ticular polymeric chemical moiety, synthesized on a bead, with a corresponding 
oligonucleotidic sequence, which can be "read" and amplified by PCR. However, 
the ECL method has a number of drawbacks. First, a general chemistry is needed 
which allows the alternating synthesis of polymeric organic molecules (often with 
different reactivity properties) and DNA synthesis on a bead. Second, the synthe- 
30 sis, management and quality control of large libraries (e.g., > 1 million individual 
members) remains a formidable task. In fact, the usefulness of the ECL method 
has yet to be demonstrated with experimental examples. 
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PRIOR ART 

From US 5,573,905 an encoded combinatorial chemical library is known which 
comprises a plurality of bifunctional molecules according to the formula A-B-C, 
5 where A is a polymeric chemical moiety. B is a linker molecule operatively linking 
A and C, consisting of a chain length of 1 to about 20 atoms and preferably com- 
prising means for attachment to a solid support. C is an identifier oligonucleotide 
comprising a sequence of nucleotides that identifies the structure of the chemical 
moiety. The attachment to a solid support is especially preferred when synthe- 

10 sizing step by step the chemical moiety (a polymer built of subunits Xi- n ) and the 
oligonucleotide (built of nucleotides Zi- n which code for and identify the structure 
of the chemical subunits of the polymer). Also described are the bifunctional 
molecules of the library, and methods of using the library to identify chemical 
structures within the library that bind to biological active molecules in preselected 

15 binding interactions. Utilizing the code C for the identification of the polymer A 
and attaching the code C to the polymer A with a linker molecule B allows the 
polymer to be identified exactly, however, the solution presented in US 
5,573,905 (which basically is the same as published by Brenner and Lerner, 
1992) is limited to this special type of a chemical moiety. The fact that individual 

20 synthesis has to be carried out for each individual of a chemical library is re- 
garded as another disadvantage. 

It is therefore an object of the present invention to provide a chemical compound 
comprising a chemical moiety of any kind capable of performing a binding inter- 
25 action with a target molecule (e.g. a biological target) and further comprising an 
oligonucleotide or functional analogue thereof which chemical compound does not 
need to be individually synthesized in order to build up a chemical library. 

This object is met according to a first aspect by the combination of features of 
30 independent claim 1, defining a chemical compound comprising a chemical moi- 
ety (p) capable of performing a binding interaction with a target molecule (e.g. a 
biological target) and further comprising an oligonucleotide (b) or functional 
analogue thereof which is characterized in that the oligonucleotide (b) or func- 
tional analogue comprises at least one self-assembly sequence (bl) capable of 
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performing a combination reaction with at least one self-assembly sequence (bl') 
of a complementary oligonucleotide or functional analogue bound to another 
chemical compound comprising a chemical moiety (q). 

5 This object is met according to a second aspect by the combination of features of 
independent claim 3, defining a chemical compound comprising a chemical moi- 
ety (p) capable of performing a binding interaction with a target molecule (e.g. a 
biological target) and further comprising an oligonucleotide (b) or functional 
analogue thereof, which comprises a coding sequence (bl) coding for the identi- 
10 fication of the chemical moiety (p), and which is characterized in that the chemi- 
cal compound further comprises at least one self-assembly moiety (m) capable of 
performing a combination reaction with at least one self-assembly moiety (m 1 ) of 
a another chemical compound comprising a chemical moiety (q). 

15 Further aspects and features of the present invention derive from the dependent 
claims. 

SUMMARY OF THE INVENTION 

20 

In our invention, we reasoned that a key contribution to the generation (and 
screening) of very large chemical libraries may come from the "self-assembly" of 
encoded molecules. In particular, we reasoned that self-assembly (e.g., by ho- 
modimerization, heterodimerization or multimerization) of DNA-tagged chemical 

25 entities would represent an avenue for the facile generation of very large DNA- 
tagged chemical libraries, starting from smaller DNA-tagged chemical libraries. 
For example, self-assembly (heterodimerization) of two libraries containing 1000 
members would yield l'000'OOO different combinations, i.e. l'000'OOO different 
chemical entities. Notably, homo- or hetero-trimerization of encoded libraries 

30 containing 1000 DNA-tagged members would yield a library containing 

l'000'OOO'OOO different DNA-tagged combinations, i.e. chemical entities. Thus, 
the present invention provides a chemical compound comprising a chemical moi- 
ety of any kind capable of performing a binding interaction with a target molecule 
(e.g. a biological target) and further comprising an oligonucleotide or functional 



WO 03/076943 



- 8 - 



PCT/EP02/04153 



analogue thereof which can be synthesized separately and then coupled to- 
gether. The resulting chemical derivative(s) of the oligonucleotide can further 
assemble with other similar compounds to generate higher order structures and 
encoded libraries of compounds. 

5 

For illustrative purposes, one particular embodiment of our invention is depicted 
in Figure 1. Two chemical libraries are synthesized by chemical modification of 
the 3' end and the 5' end, respectively, of oligonucleotides capable of duplex 
formation and which carry distinctive "sequence tags" (associated with [and 

10 therefore "coding for"] the chemical moiety attached to their extremity). The re- 
sulting encoded self-assembled chemical library (ESACHEL) can be very large (as 
it originates from the combinatorial self-assembly of two smaller libraries) and 
can be screened for binding to a biological target (e.g., a protein of pharmaceuti- 
cal interest). Those members of the library which display suitable binding specifi- 

15 cities can be captured with the target of interest (for example, using a target 

immobilized on a solid support). Their genetic code, encoding the chemical entity 
responsible for the binding specificity of interest, can then be retrieved using a 
number of ingenious methods, which are described in the section "Description of 
the Invention" (see below). 

20 

DEFINITIONS 

Specific: 

25 This may be used to refer to the situation in which one member of a specific 

binding pair will not show any significant binding to molecules other than its spe- 
cific binding partner(s). In general, specificity is associated with a significant dif- 
ference in binding affinity, relative to "non-specific" targets. The term is also ap- 
plicable where e.g. a binding member is specific for a particular surface on the 

30 target molecule (hereafter termed as " epitope"), in which case the specific 

binding member with this specificity will be able to bind to various target mole- 
cules carrying the epitope. 
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FIGURE CAPTIONS 

Figure 1: A simple embodiment of ESACHEL technology: 

In a simple embodiment of ESACHEL technology, two chemical libraries are syn- 
5 thesized by individual chemical modification of the 3' end and the 5' end, respec- 
tively, of oligonucleotides capable of partial heteroduplex formation and which 
carry distinctive "sequence tags' 7 (associated with [and therefore "coding for"] 
the chemical moieties p and q attached to their extremity). The resulting en- 
coded self-assembled chemical library (ESACHEL) can be very large (as it origi- 

10 nates from the combinatorial self-assembly of two smaller libraries) and can be 
screened for binding to a biological target (e.g., a protein of pharmaceutical in- 
terest). Those members of the library which display suitable binding specificities 
can be captured with the target of interest (for example, using a target immobi- 
lized on a solid support). Their genetic code, encoding the chemical entity re- 

15 sponsible for the binding specificity of interest, can then be retrieved using a 
number of ingenious methods 

Figure 2: Generalization of the ESACHEL design: 

The main ingredients of ESACHEL technology are chemical compounds, com- 
20 prising an oligonucleotidic moiety (typically, a DNA sequence) linked to an 
oligomerization domain [capable of mediating the (homo- or hetero-) dimeri- 
zation, trimerization or tetramerization of the chemical compounds], linked to 
a chemical entity, which may be involved in a specific binding interaction with a 
target molecule. Part of the sequence of the oligonucleotidic moiety will be 
25 uniquely associated with the chemical entity (therefore acting as a "code"). The 
oligomerization domain and the code can be distinct portions of the same mole- 
cule (typically an oligonucleotide). 

Figure 3: Self-assembly of individual ESACHEL chemical compounds can 
30 yield combinatorial libraries of large size: 

In a practical embodiment of ESACHEL technology, a number n of different 
chemical compounds, carrying a thiol-reactive moiety (e.g., a maleimido or a io- 
doacetamido group), are reacted (in separate reactions) with n different DNA 
oligonucleotides, carrying a thiol group at the 3' end. The corresponding pool of n 
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conjugates is indicated in the Figure as "pool A". Similarly, a number m of differ- 
ent chemical compounds, carrying a thiol-reactive moiety (e.g., a maleimido or a 
iodoacetamido group), are reacted (in separate reactions) with m different DNA 
oligonucleotides, carrying a thiol group at the 5' end. The corresponding pool of 
5 m conjugates is indicated in the Figure as "pool B". The resulting self-assembled 
library members will correspond to m x n combinations. 

Figure 4: Large library sizes can be achieved by ESACHEL embodiments, 
in which trimerization domains or tetramerization domains are DNA se- 
10 quences forming triplexes or quadruplexes: 

Certain DNA sequences are known to be capable of forming stable trimeric com- 
plexes or stable tetrameric complexes. For example, Hoogsten pairing of DNA 
triplexes could allow the self-assembly of Pools A x B x C, containing n, m and I 
members, respectively. The tetrameric assembly of DNA-(chemical moiety) con- 
15 jugates would allow even larger library sizes, starting from sub-libraries A, B, C 
and D of small dimension. 

Figure 5: One method of ESACHEL decoding: 

The oligonucleotides of sub-library A bear chemical entities at the 3' end. To- 
20 wards the 3' extremity, the DNA sequence is designed to hybridize to the DNA 
sequences at the 5' extremity of oligonucleotides of sub-library B. The hybridiza- 
tion region is interrupted by a small segment. In sub-library A, this small seg- 
ment is conveniently composed of phosphodiester backbone without bases 
(termed d-spacer in the Figure); in sub-library B, the corresponding short seg- 
25 ment will have unique sequence for each member of the sub-library (therefore 
acting as "code" for the sub-library B). By contrast, oligonucleotides of sub- 
library A have their distinctive code towards the 5' extremity. 

After biopanning, oligonucleotides of sub-library B remain stably annealed to oli- 
30 gonucleotides of sub-library A, and can work as primers for a DNA polymerase 
reaction on the template A. The resulting DNA segment, carrying both code A 
and code B, can be amplified (typically by PCR), using primers which hybridize at 
the constant extremities of the DNA segment. 
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Figure 6: A general method of ESACHEL decoding: 

The identity of specific binders, isolated from sub-libraries A and B carrying 
chemical moieties at the extremities of partially-annealing oligonucleotides, is 
established by hybridization with target oligonucleotides immobilized on one or 
5 more chips. Such chips preferably are made from silicon wafers with attached 
oligonucleotide fragments. For example, chip A will allow the reading of the iden- 
tity (and frequency) of members of sub-library A, rescued after a biopanning ex- 
periment. Similarly, chip B will allow the reading of the identity (and frequency) 
of members of sub-library B. In a first step, the decoding method depicted in the 

10 Figure will not provide information about the pairing of code A and code B within 
specific binding members. However, decoding on chip A and B will suggest candi- 
date components of sub-libraries A and B, to be re-annealed and screened in a 
successive round of bio-panning. Increasingly stringent binding to the target will 
be mirrored by a reduction in the number of A and B members, identified on the 

15 chip. Ultimately, the possible combinations of the candidate A and B members 
will be assembled individually (or in smaller pools), and assayed for binding to 
the target. 

Figure 7: A PCR-based method for ESACHEL deconvolution: 

20 Sub-libraries A and B form a heteroduplex, flanked by unique sequences coding 
for the different library members and by constant DNA segments at the termini. 
After biopanning, suitable pairs of primers allow the PCR amplification of the two 
strands, yielding PCR products whose sequence can be identified using standard 
methods (e.g. by concatenation of the PCR products, followed by subcloning and 

25 sequencing. A deconvolution procedure may be applied (consisting of one or 

more rounds of panning, followed by sequencing and by the choice of a restricted 
set of sub-library components for the next ESACHEL screening), restricting the 
number of candidate ESACHEL members capable of giving specific binders after 
self-assembly. 

30 

Figure 8: Converting ESACHEL ligands into covalently-linked chemical 
moieties: 

In many ESACHEL embodiments, chemical derivatives of self-assembling oligonu- 
cleotides will be isolated at the end of one or more rounds of panning. For many 
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applications, it will be desirable to covalently link together the chemical moieties, 
responsible for the interaction with the target molecule of interest. The length, 
rigidity, stereoelectronic chemical properties and solubility of the linker will influ- 
ence the binding affinity and performance of the resulting molecule. 

5 

Figure 9: Chemical equilibria contributing to the chelate effect: 

The diagram shows the possible states of the interactions between a bidentate 
ligand (A-B) binding to a target molecule. In state nl, both A and B moieties are 
bound to their respective binding pockets. In state nil and nlll only moiety A or 
10 B are bound, respectively. In state nIV, the compound A-B is dissociated from 
the target. 

A computer program has been written for the approximate evaluation of the 
contribution of the chelate effect to the residence time of A-B on the target in 
15 irreversible dissociation conditions, as a function of kinetic association and disso- 
ciation constants of the moieties A and B towards their respective binding pock- 
ets, and of the linker length between A and B. The probability that one moiety 
dissociates per time unit is indicated as poff. The probability that one moiety 
binds to the target per time unit is indicated as pon. 

20 

Figure 10: Assembly of molecule p with molecular repertoire Q:The dia- 
gram shows heteroduplex formation between an oligonucleotide, coupled to a 
low-affinity binder p, and a second class of oligonucleotides, which bear chemical 
moieties q and distinctive codes, capable of identifying the molecules q which 
25 synergise with p for binding to a target molecule (e.g., a protein target). 



DESCRIPTION OF THE INVENTION 

30 The main ingredients of ESACHEL technology are chemical compounds, com- 
prising an oligonucleotidic moiety (typically, a DNA sequence) linked to an 
oligomerization domain [capable of mediating the (homo- or hetero-) dimeri- 
zation, trimerization or tetramerization of the chemical compounds], linked to 
a chemical entity, which may be involved in a specific binding interaction with a 
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target molecule (Figure 2). Part of the sequence of the oligonucleotidic moiety 
will be uniquely associated with the chemical entity (therefore acting as a 
"code"). In many cases, portions of the same oligonucleotide will serve as the 
oligomerization domain and the code. 

5 

Oligonucleotidic moiety: 

The nature and the design of the oligonucleotidic moieties in ESACHEL technology 
is best understood by the description of the "coding" system, provided in the 
sections below and in the Examples. As an introduction, it suffices to say that, by 

10 stably associating chemical entities with a unique oligonucleotidic sequence (e.g., 
a sequence of DNA or DNA analogues), one provides that chemical entity with a 
unique code, which can be "read" in a variety of ways (sequencing, hybridization 
to DNA chips, etc.) and which may be amenable to amplification (e.g., by the use 
of the polymerase chain reaction [PCR]). Furthermore, using ingenious methods 

15 described below, the code of one particular chemical compound may become 
physically linked to the code of other chemical compound(s), when these 
chemical compounds are associated by means of an oligomerization do- 
main. 

20 Oligomerization domains: 

Suitable DNA sequences (capable of heteroduplex, triplex [Strobel SA, Dervan 
PB. Single-site enzymatic cleavage of yeast genomic DNA mediated by triple he- 
lix formation. Nature. (1991) 350:172-174] or quadruplex [Various authors. Is- 
sue of Biopolymers (2000-2001), volume 56 (3)] formation), can be considered 
25 as possible oligomerization domains. 

Alternatively, the use of other self-assembling polypeptides (e.g. amphipathic 
peptide helices such as leucine zippers) could be considered. Many more chemi- 
cal moieties could be considered as mediators of chemically defined, oligomeric 
30 moieties. For instance, complexes of metal atoms with suitable ligands [such as 
dipyridyl or tripyridyl derivatives) could be envisaged. Furthermore, one could 
envisage that a non-covalent interaction would bring together different chemical 
compounds, which could then react with one another and become covalently as- 
sociated. 
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Some practical embodiments of ESACHEL technology: 

In order to illustrate one possible practical embodiment of ESACHEL technology, 
let us consider the following example (depicted in Figure 3). 

5 A number n of different chemical compounds, carrying a reactive moiety (e.g. a 
thiol-reactive maleimido or a iodoacetamido group), are reacted (in separate re- 
actions) with n different DNA oligonucleotides, carrying a reactive moiety (e.g. a 
thiol group at the 3' end). The corresponding pool of n conjugates is indicated in 
the Figure as "pool A". The oligonucleotides of pool A are designed to have: 
10 - one portion of the DNA sequence which can hybridize to compounds of pool B 
(see Figure 3 and comments below) 

- a distinctive DNA sequence for each of the n members of Pool A 

- additional portions of the DNA sequence judiciously designed for "decoding" 
binding combinations (optional; see below in the section about "Decoding" 

15 and in the Examples). 

Similarly, a number m of different chemical compounds, carrying a thiol-reactive 
moiety (e.g., a maleimido or a iodoacetamido group), are reacted (in separate 
reactions) with m different DNA oligonucleotides, carrying a thiol group at the 5' 
20 end. The corresponding pool of m conjugates is indicated in the Figure as "pool 
B". 

The oligonucleotides of pool B are designed to have: 

- one portion of the DNA sequence which can hybridize to compounds of pool A 
25 (see Figure 3) 

- a distinctive DNA sequence for each of the m members of Pool B 

- additional portions of the DNA sequence judiciously designed for "decoding" 
binding combinations (optional). 

30 The partially complementary strands of the DNA conjugates of pool A and pool B 
can easily heterodimerize in solution, with comparable efficiency within the dif- 
ferent n members of Pool A and the m members of Pool B. If suitable stoichi- 
ometric ratios of the compounds of Pool A and Pool B are used, the n different 
types of compounds of Pool A will heterodimerize with the m different types of 
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compounds of Pool B, yielding a combinatorial self-assembled chemical library of 
dimension m x n. For example, two libraries of thousands of compounds would 
yield millions of different combinations. Furthermore, the resulting self- 
assembled m x n combinations will carry unique DNA codes, corresponding to 
5 the non-covalent but stable association (heterodimerization) of the DNA code of 
the member of Pool A with the DNA code of the member of Pool B. 

As an alternative to the coupling of chemical entities to thiol-bearing oligonucleo- 
tides, a number of standard chemical alternatives can be considered (e.g., oligo- 
10 nucleotides carrying a phosphodiester bond at one extremity, forming chemical 
structures such as -0-P(0) 2 -0-(CH 2 )n-NH-CO-R, where R may correspond to a 
number of different chemical entities, and n may range between 1 and 10). 

Let us assume that one particular member of the library is capable of specific 
15 binding to a target of interest (e.g., a protein immobilized on a bead). Let us also 
assume that both strands A and B contributed to the specific binding interaction 
(for a discussion of the chelate effect which may facilitate this specific binding, 
see below). It should be possible to preferentially enrich this particular combina- 
tion of A and B over the m x n combinations (for example, by exposing the li- 
20 brary to the target protein on the bead, followed by the physical removal of the 
bead from the library solution, followed by judicious washing of the bead, to re- 
duce the amount of non-specific binders on the bead). The analogy of this proce- 
dure to the biopanning procedures used with antibody phage libraries (Viti, 2000) 
should be evident to any person skilled in the art. 

25 

The rescue of the particular combination of A and B, displaying the desired bind- 
ing specificity, can then be followed by the identification of the chemical entities 
responsible for the binding, by identifying the DNA codes of the two strands A 
and B [see later section on "decoding" for a discussion on possible strategies for 
30 the identification of the DNA codes]. 



For a number of applications, at the end of the ESACHEL procedure, it may be 
desirable to link the two chemical entities A and B, responsible for the specific 
binding to the target (Figure 3). One may want to try a variety of different 
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chemical linkers, and assess whether the resulting chemical compounds, derived 
from the chemical entities A and B, display desired molecular properties (e.g., 
high affinity for the target, high specificity for the target, suitable chemical stabil- 
ity, suitable solubility properties, suitable pharmacological properties, etc.). For 
5 example, the length of the chemical linker between A and B will dramatically in- 
fluence he binding properties of the conjugate (for a discussion, see section be- 
low on the chelate effect). 

In the case of Figure 3, a DNA portion is used as heterodimerization domain, and 
10 thioether bond formation is used for the coupling of DNA oligonucleotides to 

chemical entities of the library. However, other oligomerization domains could be 
considered, as well as other chemical avenues for the coupling of chemical enti- 
ties to DNA. 

15 Certain DNA sequences are known to be capable of forming stable trimeric com- 
plexes [Strobel, 1991] or stable tetrameric complexes [Various authors, 2000- 
2001]. For example, Hoogsten pairing of DNA triplexes could allow the self- 
assembly of Pools A x B x C, containing n, m and I members, respectively (Fig- 
ure 4). The tetrameric assembly of DNA-(chemical moiety) conjugates would al- 

20 low even larger library sizes, starting from sub-libraries A, B, C and D of small 
dimension. However, the decoding of binding interactions can, in some cases, be 
more difficult for trimeric and/or tetrameric self-assembled encoded libraries, as 
compared to dimeric libraries. Furthermore, the length and flexibility of the link- 
ers between the DNA strands and the chemical moieties displayed at their ex- 

25 tremity may either facilitate or hinder the identification of specific binding mem- 
bers of the encoded self-assembled chemical (ESACHEL) library. A certain degree 
of flexibility may allow suitable chemical moieties to find complementary pockets 
on the target molecule (Figure 4). On the other hands, the affinity contribution of 
the chelate effect is expected to decrease with linker length. 

30 

It is worth mentioning that, starting from sub-libraries of 100 members, trimeric 
ESACHEL libraries would contain 10 6 members, while tetrameric ESACHEL librar- 
ies would contain 10 8 members. It is easy to calculate the resulting library size, 
starting from sub-libraries of different dimension. The large combinatorial com- 



WO 03/076943 



- 17 - 



PCT/EP02/04153 



plexity of encoded self-assembling chemical compounds may allow the identifica- 
tion of specific binding members, which have so far escaped identification using 
conventional combinatorial chemical methods. An analogy can be drawn from the 
field of antibody phage technology, where it was demonstrated that library size 
5 plays a crucial role in the isolation of high-affinity antibodies. 

ESACHEL codes and decoding methods: 

In ESACHEL technology, unique oligonucleotidic sequences (typically, DNA se- 
quences) provide chemical entities with a unique code. How many different se- 
10 quences do we need, in order to identify members of a library? 

As mentioned above, the key components of ESACHEL technology are chemical 
compounds, comprising an oligonucleotidic moiety (typically, a DNA se- 
quence) linked to an oligomerization domain, in turn linked to a chemical 

15 entity. In most cases, the oligonucleotidic moiety will also provide the oligomeri- 
zation domains. As a consequence, in most cases, ESACHEL components will be 
chemical entities coupled to judiciously chosen DNA oligonucleotides. Typically, 
such oligonucleotides will contain a constant part and a variable part (uniquely 
characteristic for each member of the library). 

20 

Let us consider, as an example, the case illustrated in Figure 3, and discussed in 
the section "Some practical embodiments of ESACHEL technology" (see above). 
In this example, a sub-library "A" (containing n compounds attached at the 3' 
extremity of DNA oligonucleotides) is assembled to a sub-library "B" (containing 

25 m compounds attached at the 5' extremity of oligonucleotides). Sub-library A can 
be represented by a DNA sequence of x bases, where 4 X is greater or equal to n. 
Sub-library B can be represented by a DNA sequence of y bases, where 4 y is 
greater or equal to m. In most cases (see below), identification of the code of 
sub-library members also provides information about which sub-library a par- 

30 ticular code (and therefore a particular compound) belongs to. 



Many methods of "decoding" the ESACHEL codes could be envisaged. Below, we 
illustrate three, which correspond to different experimental requirements and 
which demonstrate the flexibility of ESACHEL technology. 
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Let us consider for simplicity the ESACHEL embodiment of Figure 3. A convenient 
design of oligonucleotides, on which sub-libraries A and B are based, is depicted 
in Figure 5. The oligonucleotides of sub-library A bear chemical entities at the 3' 
end. Towards the 3' extremity, the DNA sequence is designed to hybridize to the 
5 DNA sequences at the 5' extremity of oligonucleotides of sub-library B. The hy- 
bridization region is interrupted by a small segment. In sub-library A, this small 
segment is conveniently composed of phosphodiester backbone without bases 
(termed d-spacer in the Figure); in sub-library B, the corresponding short seg- 
ment will have unique sequence for each member of the sub-library (therefore 
10 acting as "code" for the sub-library B). By contrast, oligonucleotides of sub- 
library A have their distinctive code towards the 5' extremity. 

After biopanning, it is desirable to learn about the code of the binding members 
displaying a desired binding specificity. Oligonucleotides of sub-library B remain 
15 stably annealed to oligonucleotides of sub-library A, and can work as primers for 
a DNA polymerase reaction on the template A. The resulting DNA segment, car- 
rying both code A and code B, can be amplified (typically by PCR), using primers 
which hybridize at the constant extremities of the DNA segment (Figure 5). 

20 If several specific binding members are isolated at the end of a biopanning ex- 
periment, several PCR products will be generated with the process illustrated in 
Figure 5. These products will have similar sequences, except for the regions 
coding for A and B sub-library members. In order to learn more about the rela- 
tive abundance of the different specific binding members, it will be convenient to 

25 create concatenamers, starting from the various PCR products in the reaction 
mixture. Such concatenated sequences can be "read" by sequencing, revealing 
both the identity and the frequency of pairs of code A and code B (which uniquely 
correspond to particular library members). 

30 An alternative decoding strategy is depicted in Figure 6. Sub-libraries A and B 

carry chemical moieties at the extremities of partially-annealing oligonucleotides. 
In most cases, the DNA portion forming a heteroduplex will be constant within 
the library. Conversely, the other extremities will be designed in a way that het- 
eroduplex formation is disfavored. Such unpaired DNA strands will be available 
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for hybridization with target oligonucleotides (for example, DNA oligonucleotides 
immobilized on one or more chips). For example, chip A will allow the reading of 
the identity (and frequency) of members of sub-library A, rescued after a 
biopanning experiment. Similarly, chip B will allow the reading of the identity 
5 (and frequency) of members of sub-library B. A variety of strategies can be con- 
sidered (e.g., DNA radiolabeling, DNA biotinylation followed by detection with 
streptavidin-based reagents, etc.) for the visualization of the binding reaction on 
the chip. 

10 In a first step, the decoding method of Figure 6 will not provide information about 
the pairing of code A and code B within specific binding members. However, de- 
coding on chip A and B will suggest candidate components of sub-libraries A and 
B, to be re-annealed and screened in a successive round of bio-panning. In- 
creasingly stringent binding to the target will be mirrored by a continuous reduc- 

15 tion in the number of A and B members, identified on the chip. Ultimately, the 
possible combinations of the candidate A and B members will be assembled indi- 
vidually (or in smaller pools), and assayed for binding to the target. We refer to 
this iterative strategy as deconvolution. 

20 Obviously, the decoding method of Figure 6 is valid also for ESACHEL, when li- 
braries self-assemble to form trimeric or tetrameric complexes (e.g. using DNA 
triplexes or quadruplexes for the oligomerization of compounds). In these cases, 
3 or 4 chips may be used, respectively, which carry distinctive target oligonu- 
cleotides for decoding. 

25 

If appropriate, the DNA of selected binding moieties of Figure 6 may be PCR am- 
plified prior to chip hybridization. In this case, oligonucleotide design will resem- 
ble the one described in the next paragraph (see also Figure 7). 

30 Another possible decoding method is illustrated in Figure 7. Sub-libraries A and B 
form a heteroduplex, flanked by unique sequences coding for the different library 
members and by constant DNA segments at the termini. After biopanning, suit- 
able pairs of primers allow the PCR amplification of the two strands, yielding PCR 
products whose sequence can be identified using standard methods (e.g. by con- 
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catenation of the PCR products, followed by subcloning and sequencing. Similar 
to the chip-based method illustrated in Figure 6, the method of Figure 7 will not 
provide, in general, direct information about pairing of code A and code B in spe- 
cific binding members. However, (similar to what described for Figure 6), a de- 
5 convolution procedure may be applied (consisting of one or more rounds of pan- 
ning, followed by sequencing and by the choice of a restricted set of sub-library 
components for the next ESACHEL screening), restricting the number of candi- 
date ESACHEL members capable of giving specific binders after self-assembly. 

10 Library construction: 

ESACHEL library construction is facilitated not only by the large dimension that 
can be achieved by self-assembly of sub-libraries, but also by the facile genera- 
tion and purification of chemical derivatives of DNA oligonucleotides. 

15 As mentioned above, DNA oligonucleotides, bearing a thiol group at their 3' or 5' 
end, can be purchased from a variety of commercial suppliers. The chemistry of 
the modification of thiol groups with reagents bearing reactive groups such as 
iodoacetamido moieties or maleimido moieties is well-established [see for exam- 
ple the molecular Probes catalogue at www.probes.com]. Furthermore, several 

20 methods are available in the literature for the chemical modification of 3' or 5' 
extremities of DNA oligonucleotides, for example during solid phase synthesis 
procedures. 

Chemical derivatives of DNA (or some DNA analogues) have the characteristic 
25 property of being highly negatively charged at neutral pH. This facilitates the de- 
velopment of general purification strategies of the DNA derivatives. For example, 
anion exchange chromatography allows the non-covalent (but stable) immobili- 
zation of DNA oligonucleotides (and their derivatives) on a resin, while other 
components of a reaction mixtures can be washed away. DNA derivatives can 
30 then be eluted by buffer change. Alternatively, other purification methods (e.g. 
reverse phase chromatography, hydrophobic interaction chromatography, hy- 
droxyapatite chromatography etc.) could be considered. 
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The availability of generally applicable purification procedures for DNA derivatives 
makes the synthesis of ESACHEL components amenable to robotization [for ex- 
ample using a TECAN Genesys 200-based workstation (TECAN, Mannedorf, Swit- 
zerland), equipped with liquid handling system and a robotic manipulation arm]. 
5 Robotization may be necessary, in order to create ESACHEL sub-libraries con- 
taining several hundred different compounds. 

The methodologies described in this Patent work not only for small organic mole- 
cules, but also for peptides and oligomeric proteins [e.g. antibody Fv fragments, 
10 consisting of a VH and VL domain; see Example 1]. Indeed, the attachment of a 
DNA heteroduplex at the C-terminus of cysteine-tagged VH and VL domains will 
provide an extra stabilization to the Fv heterodimer. 

Biopanning experiments: 

15 The use of ESACHEL for the identification of specific binders relies on the incuba- 
tion of ESACHEL components with the target molecule (e.g., a protein of phar- 
macological interest), followed by the physical separation of the resulting com- 
plex from the ESACHEL components which have not bound to the target. 
In this respect, ESACHEL biopanning experiments are analogous to biopanning 

20 experiments which can be performed with phage libraries and/or ribosome dis- 
play libraries, for which an extensive literature and several experimental proto- 
cols are available [Winter, 1994;; Viti, 2000; Schaffitzel, 1999]. For example, 
physical separation of the complex between ESACHEL members and the target 
molecule, from the pool of non-bound ESACHEL members, could be achieved by 

25 immobilizing the target molecule of a solid support (e.g. a plastic tube, a resin, 
magnetic beads, etc.). 

The chelate effect: 

Some of the contributions of ESACHEL technology for the identification of specific 
30 binders are related to a chemical process, termed the "chelate effect". The term 
chelate was first applied in 1920 by Sir Gilbert T. Morgan and H.D.K. Drew [J. 
Chem. Soc, 1920, 117, 1456], who stated: "The adjective chelate, derived from 
the great claw or chela (chely- Greek) of the lobster or other crustaceans, is sug- 
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gested for the caliper-like groups which function as two associating units and 
fasten to the central atom so as to produce heterocyclic rings." 

The chelate effect can be seen by comparing the reaction of a chelating ligand 
5 and a metal ion with the corresponding reaction involving comparable monoden- 
tate ligands. For example, comparison of the binding of 2,2'-bipyridine with pyri- 
dine or 1,2-diaminoethane (ethylenediamine) with ammonia. It has been known 
for many years that a comparison of this type always shows that the complex 
resulting from coordination with the chelating ligand is much more thermody- 
10 namically stable. 

Let us consider the dissociation steps of a monodentate ligand, compared to 
multidentate (e.g., bidentate ligands). When a monodentate group is displaced, it 
is lost into the bulk of the solution. On the other hand, if one end of a bidentate 
15 group is displaced the other arm is still attached and it is only a matter of the 
arm rotating around and it can be reattached again (Figure 8). In general, the 
formation of the complex with bidentate groups is favored, compared to the 
complex with the corresponding monodentate groups. 

20 The chelate effect has been shown to be able to contribute to high-affinity bind- 
ing not only in the case of multidentate metal ligands, but in many other chemi- 
cal situations, including binding interactions with macromolecules (e.g., multi- 
dentate DNA binding, chelating recombinant antibodies) [Neri D, Momo M, Pros- 
pero T, Winter G. High-affinity antigen binding by chelating recombinant anti- 

25 bodies (CRAbs). J Mol Biol. (1995) 246:367-73]. 

When examining some ESACHEL embodiments, for example those in which two 
chemical moieties are oligomerized by means of DNA heteroduplex formation, it 
is useful to illustrate the chelate effect in the context of the stability of the DNA 
30 heteroduplex which bridges the two chemical entities involved in the specific 
binding interaction with a target. In most cases, it will be convenient to have 
heteroduplexes (or triplexes or quadruplexes) which de facto do not dissociate in 
the experimental conditions chosen for the ESACHEL biopanning. Useful informa- 
tion and a discussion on the energetics of cooperative binding with short DNA 
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heteroduplex fragments (8 bp) can be found in Distefano and Dervan, 1993 
[Distefano MD, Dervan PB. Energetics of cooperative binding of oligonucleotides 
with discrete dimerization domains to DNA by triple helix formation Proc Natl 
Acad Sci USA. (1993) 90: 1179-1183.]. 

5 

Considerations on the procedures following ESACHEL biopanning: 

What happens after ESACHEL experiments, when specific binding members have 
been identified? For some purposes (e.g., certain biochemical experiments), it 
may be conceivable to use ESACHEL members without further chemical trans- 
10 formations. For example, one may want to measure binding affinities and kinetic 
constants for the binding of ESACHEL members to a target molecule. 

For many applications, however, one may want to covert ESACHEL self- 
assembled molecules into analogues, in which the chemical entities responsible 
15 for the binding are covalently linked. The length, rigidity, stereoelectronic chemi- 
cal properties and solubility of the linker will influence the binding affinity and 
performance of the resulting molecule [Shuker SB, Hajduk PJ, Meadows RP, Fesik 
SW. Discovering High-Affinity Ligands For Proteins - Sar by Nmr. Science (1996) 
274:1531-1534] (see also Example 4). 

20 

EXAMPLES 

Example 1: 

25 As mentioned in previous sections, one strength of ESACHEL technology is its 

compatibility with a variety of different chemical moieties, including peptides and 
globular proteins (e.g., antibody domains). 

In this example, we show how a simple embodiment of ESACHEL (Figure 1), 
30 featuring cysteine-tagged antibody variable domains covalently linked to DNA 
oligonucleotides capable of partial heteroduplex formation, leads to the identifi- 
cation of a pair of variable heavy domain (VH) and variable light domain (VL), 
which yield a specific antigen binding after heterodimerization. 
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The genes of the VH and VL domains of the L19 antibody (specific for the ED-B 
domain of fibronectin [Pini A, Viti F, Santucci A, Carnemolla B, Zardi L, Neri P, 
Neri D. Design and use of a phage display library. Human antibodies with sub- 
nanomolar affinity against a marker of angiogenesis eluted from a two-dimen- 
5 sional gel. J Biol Chem. (1998) 273:21769-21776]), of the HyHEL-10 antibody 
(specific for hen egg lysozyme [Neri, 1995]; please note that an internal EcoRI 
site had previously been mutagenized without altering the protein sequence) and 
of other antibodies isolated from the ETH-2 library (Viti, 2000), are PCR amplified 
using the following pairs of primers, which code for a cysteine residue, appended 
10 at the C-terminal extremity of each V domain: 

L19 and ETH-2: 

L19VH_Eco_fo 

1 5 TTT CAC ACA GAA TTC ATT AAA GAG GAG AAA TTA ACT ATG GAG GTG CAG CTG 
TTG GAG TCT 

L19VH_Hind_ba 

TCA ATC TGA TTA AGC TTA GTG ATG GTG ATG GTG ATG ACA TCC ACC ACT CGA 
20 GAC GGT GAC CAG GGT 

L19VL_Eco_fo 

TTT CAC ACA GAA TTC ATT AAA GAG GAG AAA TTA ACT ATG GAA ATT GTG TTG 
ACG CAG TCT CCA 

25 

L19VL_Hind_ba 

TCA ATC TGA TTA AGC TTA GTG ATG GTG ATG GTG ATG ACA TCC ACC TTT GAT 
TTC CAC CTT GGT CCC TTG 

30 HyHEL-10: 

HH10VH_Eco_fo 

TTT CAC ACA GAA TTC ATT AAA GAG GAG AAA TTA ACT ATG GAG GTG AAG CTG 
CAG CAG TCT 
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HH10VH_Hind_ba 

TCA ATC TGA TTA AGC TTA GTG ATG GTG ATG GTG ATG ACA TCC ACC TGC AGA 
GAC AGT GAC CAG AGT 

5 HH10VL_Eco_fo 

TTT CAC ACA GAA TTC ATT AAA GAG GAG AAA TTA ACT ATG GAT ATT GTG CTA 
ACT CAG TCT CCA 

HH10VL_Hind_ba 

10 TCA ATC TGA TTA AGC TTA GTG ATG GTG ATG GTG ATG ACA TCC ACC TTT TAT 
TTC CAG CTT GGT CCC CCC 

The resulting PCR products are subcloned, using standard molecular biology pro- 
cedures, into the EcoRI/Hindlll sites of plasmid pQE12 (Qiagen, Germany). The 
15 resulting plasmids code for V domains, which carry the following sequence at 
their C-terminus: -Gly-Gly-Cys-His-His-His-His-His-His. 

The plasmids, encoding the cysteine-tagged V-domains, are electroporated into 
E.coli cells (preferentially, in the Origami strain of Novagen, which have a slightly 
20 oxidizing cytoplasmic redox potential), expressed and purified, using metal che- 
late chromatography on NiNTA resin (Qiagen, Germany). 

The cysteine-tagged V domains are reduced with 1 mM dithiothreitol solution in 
PBS (50 mM phosphate buffer + 100 mM NaCI, pH = 7.4), followed by desalting 
25 on a PD-10 column (Amersham-Pharmacia, Dubendorf, Switzerland). 

In parallel, different oligonucleotides, carrying a thiol group at the 3' end or at 
the 5' end, are ordered from a commercial supplier (e.g., Microsynth, Balgach, 
Switzerland). Individual DNA oligonucleotides with the thiol group at the 3' end 
30 are used for coupling to individual VH domains. Individual DNA oligonucleotides 
with the thiol group at the 5' end are used for coupling to individual VL domains. 

Representative sequence types are illustrated below. Please note that oligonu- 
cleotides of these families are capable of partial heteroduplex formation: 
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L19: 

L19_5SH 

5-HS-GGA GCT TCT GAA TTC TGT GTG CTG CAT AAT CGA CAC GAA TTC CGC 
5 AGC-3' 

L19_3SH 

5'-TCG CGA GGG GAA TTC GTC ATA TAT CAG CAC ACA GAA TTC AGA AGC TCC- 
SH-3' 

10 

HyHEL-10: 
HyHellO_5SH 

5-HS-GGA GCT TCT GAA TTC TGT GTG CTG CAG TGG CGA CAC GAA TTC CGC 
15 AGC-3' 

HyHellO_3SH 

5'-TCG CGA GGG GAA TTC GTC ATA GGG CAG CAC ACA GAA TTC AGA AGC TCC- 
SH-3' 

20 

Anti-GST (from ETH-2 library): 
GST_5SH 

5-HS-GGA GCT TCT GAA TTC TGT GTG CTG CTG AGG CGA CAC GAA TTC CGC 
25 AGC-3' 

GST_3SH 

5'-TCG CGA GGG GAA TTC GTC AAG AGG CAG CAC ACA GAA TTC AGA AGC TCC- 
SH-3' 

30 

In parallel reactions, the purified thiol-containing DNA oligonuclotides are reacted 
with a molar excess of bismaleimido-hexane (Pierce, Belgium) in PBS + DMSO, 
following the manufacturer's instructions. The resulting derivatives are purified 
from unreacted bismaleimido-hexane using anion exchange chromatography, 
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then reacted with slight molar excess of purified VH-cys or VL-cys, respectively, 
at a domain concentration > 0,1 mg/ml. The resulting (V domain)-DNA reaction 
products are separated from unreacted V-domain by anion exchange chroma- 
tography. 

5 

An equimolar mixture of (V domain)-DNA derivatives is mixed in PBS, heated at 
70 degrees centigrade for 1 minute, then let equilibrate until it reaches room 
temperature. The resulting mixture of ESACHEL compounds is then incubated 
with a 0.1 |jM solution of biotin-ED-B in PBS at room temperature for 10 minutes 
10 (Pini, 1998), then captured on streptavidin-coated magnetic beads and washed 
extensively according to standard procedures. 

The resulting bead preparation is then used as template for two separate PCR 
reactions, which amplify the (L19_5SH, HyHellO_5SH, GST_5SH) and (L19_3SH, 
15 HyHellO__3SH, GST_3SH) oligonucleotides (see above), using oligos: 

lAB_PCRfo 

5'-GGA GCT TCT GAA TTC TGT GTG CTG-3' 

20 lAPCRba 

5'-GCT GCG GAA TTC GTG TCG-3' 

lB_PCRba 

5'-TCG CGA GGG GAA TTC GTC-3'. 

25 

The resulting PCR products are digested with EcoRl, ligated to form concatenam- 
ers, subcloned into a suitable host plasmid, followed by electroporation in E.coli 
and sequencing. The resulting sequence analysis shows a strong bias towards 
L19 codes (CAT AAT and ATA TAT) over HyHEL-10 and GST codes, indicating a 
30 preferential enrichment of VH(L19)-VL(L19) combinations over the other possible 
assembly products. 
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Example 2: 

In this example, we describe how the ESACHEL embodiment of Figure 1 can be 
performed in a practical implementation. The experimental strategy outlined here 
is also applicable to the embodiments described in Figure 4, in which DNA tri- 
5 plexes or DNA quadruplexes are used to display chemical entities at the extrem- 
ity of self-assembling oligonucleotides. 

Two sub-libraries are constructed as follows: 

A sub-library "A" is created, by coupling n compounds to the 3' extremity of n 
10 different DNA oligonucleotides. Among the many different possible implementa- 
tions, a convenient one is represented by the coupling of iodoacetamido- or 
maleimido-derivatives of n chemical entities to individual DNA oligonucleotides, 
which carry a thiol group at the 3' end. The coupling can easily be performed at 
room temperature in PBS (50 mM phosphate buffer + 100 mM NaCI, pH = 7.4), 
15 by simple mixing of the thiol-bearing oligonucleotide (typical concentration 
range: 10 - 100 pM) with a molar excess of iodoacetamido- or maleimido- 
derivative (typical concentration range: 50 - 500 pM), followed by chroma- 
tographic purification of the DNA-chemical entity adduct. The thiol-containing 
oligonucleotides can be purchased from commercial suppliers. Each of them 
20 contains a constant sequence portion 

(e.g., 5'-XXXXXCAGCACACAGAATTCAGAAGCTCC-3') capable of heteroduplex 
formation with members of sub-library B (see below). The DNA sequence portion 
XXXXX at the 5' end is (at least in part) different in each member of the sub-l- 
ibrary A, therefore acting as a code. 

25 

Similarly, a sub-library "B" is created, by coupling m compounds to the 5' ex- 
tremity of m different DNA oligonucleotides. Coupling of iodoacetamido- or 
maleimido-derivatives of m chemical entities to individual DNA oligonucleotides, 
which carry a thiol group at the 5' end, is performed similar to what described for 
30 sub-library "A". Such oligonucleotides can be purchased from commercial suppli- 
ers. Each of them contains a constant sequence portion 

(e.g., 5 '- G GAG CTTCTG AATTCTGTGTG CTG YYYYY- 3 ') capable of heteroduplex for- 
mation with members of sub-library A (see above). The DNA sequence portion 
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YYYYY at the 3' end is (at least in part) different in each member of the sub-l- 
ibrary B, therefore acting as a code. 

Assembly of sub-library A members with sub-library B members is carried out by 
5 mixing the sub-libraries in PBS, heating the mixture at 70 degrees centigrade for 
1 minute (if compatible with the stability of the chemical entities used in sub- 
library construction), followed by equilibration at room temperature. The result- 
ing ESACHEL library contains n x m members, and can be used in biopanning 
experiments, followed by decoding of the binding members. 

10 

Example 3: 

This example illustrates one of the many possible decoding methodologies, for 
ESACHEL embodiments as described in Figure 1 and in Example 2. 

15 

The decoding strategy, schematically depicted in Figure 5, is based on the princi- 
ple that, after biopanning of desired ESACHEL binding specificities, PCR frag- 
ments are generated, each of which carries the code of pairs of sub-library 
members, whose combination was rescued in the biopanning experiment, there- 
20 fore allowing the identification of the corresponding heterodimerized chemical 
entities. 

Chemical entities of sub-libraries A and B (see also Figure 1 and Example 2) are 
coupled, individually, to members of two pools of DNA oligonucleotides with the 
25 following properties: 

- One pool of oligonucleotides carries the chemical entities at the 3'-end (pool 
A), whereas the other pool carries the chemical entity at the 5'-end (pool B). 

- A sufficient number of bases at the 5' extremity of oligonucleotides of pool B 
allow the specific dimerization of any individual member of pool B with any in- 

30 dividual member of pool A. Inside this dimerization domain, oligonucleotides 

from pool B contain a "code" region, which codes for the chemical entity at 
the 5'-end. Oligonucleotides of pool A contain a sufficient number of desoxyri- 
bose backbone elements without bases (d-Spacer), to prevent any undesired 
pairing to the bases of code B. 
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- Oligonucleotides of sub-library A have their distinctive code towards the 5' 
extremity. 

Oligonucleotides of sub-library B remain stably annealed to oligonucleotides of 
5 sub-library A, and can work as primers for a DNA polymerase reaction on the 

template A. The resulting DNA segment, carrying both code A and code B, can be 
amplified (typically by PCR), using primers which hybridize at the constant ex- 
tremities of the DNA segment (Figure 5). 

10 As an example of model oligonucleotides A and B which can be used for the gen- 
eration of a PCR product, which carries both code A and B, is provided below: 

typeB__oligo 

Chemical entity B - 5'-GCA TAC CGG AAT TCC CAG CAT AAT GAT CGC TAT CGC 
15 TGC-3' 

typeA_oligo (d=d-Spacer element) 

5'-CGT CAG CTC GAA TTC TCC ATA TAT GCA GCG ATA GCG ATC DDD DDD CTG 
GGA ATT CCG GTA TGC -3' - chemical entity A 

20 

CodeABfo 

5'-GCA TAC CGG AAT TCC CAG-3' 

CodeABba 

25 5'-CGT CAG CTC GAA TTC TCC-3' 

typeA_oligo and type_B oligo are mixed in approx. equimolar amounts. The re- 
sulting mixture is heated up to 70°C, and cooled to room temperature, allowing 
the heterodimerization of typeA_oligo and type_B oligo. The resulting mixture is 
30 mixed with a suitable buffer for PCR reaction, dNTPs (250 pM per nucleotide, 

Pharmacia). Taq-Polymerase (1U, Appligen) is then added, and followed by incu- 
bation of the mixture at 40°C for 5 minutes. Next, a PCR program with 30 cycles 
of (90°/l minute)-(50°C/l minute)- (72°C/15 seconds) is started after addition 
of primers CodeABfo and CodeABba (400pM). After completion of the program, 
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the length of the PCR fragment is checked by standard polyacrylamide gel meth- 
odology, using commercial Novex gels. Its sequence identity can be established 
by digesting the product with EcoRI, followed by cloning into a suitable plasmid 
and sequencing. 



Example 4: 

Like chelating recombinant antibodies (CRAbs) [Neri, 1995] and small organic 
ligands identified using SAR by NMR [Shuker, 1996], the high-affinity binding of 
10 ESACHEL members to target molecules may rely on the chelate effect. 

It is intuitive that the affinity gain contribution of the chelate effect will depend 
on the length, rigidity, stereoelectronic chemical properties and stability of the 
linkage between the two (or more) chemical entities, in contact with the target 
15 antigen. Furthermore, the affinity gain will directly depend on the magnitude of 
the association and dissociation rate constants (k on and k off ) of the individual 
chemical entities, binding to the target. 

In this example, we present a computational model, which provides information 
20 about the contribution of the chelate effect, in relation to the above mentioned 
parameters (linker length, k on and k 0 ff). 

As depicted in Figure 9, two different chemical entities A and B bind to distinct 
binding sites of the same target molecule, and are connected by a linker of de- 
25 fined length. It is convenient to define four different states (nl, nil, nlll and 
nIV), which may interconvert by means of chemical binding equilibria: 



5 



30 



nIV: 



nl: 



nil: 



nlll: 



Both A and B bound to their binding pocket 

A bound to its binding pocket, while B is not bound to its binding 
pocket 

B bound to its binding pocket, while A is not bound to its binding 
pocket 

Both A and B not bound to their binding pockets 
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The kinetic parameters k on A, k offA/ k onB and k of f B , describing the binding properties 
of the individual chemical entities A and B to the corresponding binding pockets, 
are known. From these constants, it is possible to determine probabilities for a 
bound molecule to go off the binding pocket (poff), and for an unbound molecule 
to bind to its binding pocket (pon) in a defined time increment. 

poff=k oJf .M [1] 
In first order kinetics, the half life of binding can be expressed as: 



If at time t=0 all molecules B are not bound to the corresponding binding pocket 
(and if one neglects in a first approximation dissociation processes), the fraction 
15 of bound molecules after the time increment At can be expressed as follows: 

J*$^ = e -"*~W [3] 



10 



N(t = 6) 



If one chooses a sufficiently large ensamble of molecules, the equation [3] can 
20 be approximated to the probability that a molecule B binds to its pocket in the 
time increment At. 

The equations written so far correspond to chemical entities A and B, which bind 
to the corresponding pockets independent from one another. Let us assume, 
25 however, that A and B are connected by a linker, and that moiety A is bound to 
its target. It is convenient to express the concentration of B in the vicinity of its 
target binding pocket as effective concentration, ec. 



pon = e on L4J 



30 
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In our model, the contribution of the chelate effect to the binding properties of 
the A-B bidentate molecule to the target is due to an increase of the effective 
concentration of one of the two binding moieties, when the other one is bound to 
its binding pocket (Figure 9). In a simple model, let us assume that, if binding 
5 molecule A is bound, the molecule B can be situated with equal likelihood in 
every position in a half spherical space around molecule A, whereby the radius 
"rad" (measured in meters) is equal to the linker length. Sterical constraints of 
the linker, repulsion effects etc. are neglected in this simple model. The same 
assumption is used when molecule B is bound, and A is unbound. As a result, the 
10 molar effective concentration ec can be computed as: 



Based on these assumptions, we designed a computer program to estimate the 
15 contribution of the chelate effect to the residual half life of a bidentate binding 
molecule A-B, where the two individual moieties A and B binding to two distinct 
binding pockets on the same target molecule are connected with a liker of the 
length rad. The possibility of A and B binding to two different target molecules is 
neglected. 

20 

In a population of n A-B molecules, the four states of Figure 9 can be populated 
by the individual molecules, and it is conceivable that individual molecules are 
found in different states at different times of observation. In our model, we de- 
termined the probabilities pon and poff of the individual molecules A and B to 
25 change their state within a time increment At. 

As a practical example, let us consider that at time t=0, all molecules A-B are in 
state nl (both A and B are bound). At every time increment At (which is 1 second 
in the program) the probabilities of the moieties A and B to change their binding 
30 status give rise to a new distribution of molecules A-B in the four different states. 
In the simulation, irreversible dissociation conditions are used (i.e., dissociated 
molecules A-B, in state nIV are not allowed to bind back to the target). The pro- 




[5] 
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gram repeats these calculations time increment after time increment, until the 
population of molecules A-B bound to the target (sum of nl, nil and nlll) drops 
to half of the starting population. The sum of the time increments gives an esti- 
mate of the half-life of a bound molecule A-B to its target. 

5 

By varying either the initial configuration of the ensamble of molecules, or the 
parameters k 0 ff A , k 0 ff B/ k on A, k on B and rad, one can estimate the contribution to the 
chelate effect of different linked chemical entities, in terms of the kinetic stabili- 
zation of the complex. 

10 

The code of the corresponding CHELATE program (written in PASCAL) is 
listed below: 

-|g *************** 
program chelate; 

var 

n, nO , koffA, koffB, koffAB, konA, konB rdouble; 
20 tl2A, tl2B, tl2AB, rad, cone: double; 
linkerA : integer; 

nl, nil, nlll, nIV, deltal, deltall, deltalll, deltalV : double; 
pAoff, pAon, pBoff, pBon : double; 

25 begin 

writeln; 

writeln ('molecules A and B are connected with a linker '); 
writeln; 

30 writeln ('koffA [s-1] = '); (* type in values *) 

readln (koffA); 
writeln ('konA [M-ls-1] = '); 
readln (konA); 
writeln ('koffB [s-1] = '); 
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readln (koffB); 
writeln ('konB [s-1] = '); 
readln (konB); 
writeln; 

5 writeln ('linker length [A] ='); 
readln (linkerA); 

tl2A: = ln(2)/koffA; (* calculate tl2 and concentration *) 
tl2B:=ln(2)/koffB; 
10 rad: = linkerA*le-10; 

conc: = l/((2/3*Pi*rad*rad*rad)*6.01e26); 

writeln ('koffA=',koffA, 1 tl2A= , / tl2A/ konA=',konA); 
writeln ('koffB=', koffB, 1 tl2B= , ,tl2B/ konB= , ,konB); 
15 writeln ('linker length [m]= , ,rad); 

writeln ('effective concentration [M]=',conc); 
writeln; 

tl2AB:=0; (* Parameters *) 

20 n0: = lel0; 

nl: = n0; nll:=0; nlll:=0; nIV:=0; (*in this embodiment of the program, only 
the nl state is populated at time 0 *) 
n: = nl + nil + nlll; 
pAoff: = koffA; 
25 pAon : = l-exp(- l*konA*conc) ; 
pBoff: = koffB; 

pBon: = l-exp(-l*konB*conc); 

30 

while n>n0/2 do 
begin 

deltal:=0; deltall:=0; deltalll:=0; deltaIV:=0; 
tl2AB:=tl2AB+l; (* one loop equals one second *) 
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(* nl *) 

deltalV :=deltaIV +(nI*pAoff*pBoff); 
5 deltalll: =deltaIII+(nI*pAoff*(l-pBoff)); 
deltall :=deltall +(nI*(l-pAoff)*pBoff); 
deltal :=deltal -(nI*pAoff*pBoff); 
deltal :=deltal -(nI*pAoff*(l-pBoff)); 
deltal :=deltal -(nI*(l-pAoff)*pBoff); 

10 

(* nil *) 

deltalll : =deltaIII+(nII*pAoff*pBon) ; 
deltalV :=deltaIV +(nII*pAoff*(l-pBon)); 
deltal :=deltal +(nII*(l-pAoff)*pBon); 
15 deltall :=deltall -(nII*pAoff*pBon); 

deltall :=deltall -(nII*pAoff*(l-pBon)); 
deltall :=deltall -(nII*(l-pAoff)*pBon); 

(* nlll *) 

20 deltall :=deltall +(nIII*pAon*pBoff); 

deltal :=deltal +(nIII*pAon*(l-pBoff)); 

deltalV :=deltaIV +(nIII*(l-pAon)*pBoff); 

deltalll: =deltaIII-(nIII*pAon*pBoff); 

deltalll: =deltaIII-(nIII*pAon*(l-pBoff)); 
25 deltalll: =deltaIII-(nIII*(l-pAon)*pBoff); 

nl :=nl + deltal; 
nil :=nll + deltall; 
nIII: = nIII + deltalll; 
30 nIV :=nIV + deltalV; 

n: = nl+nll+nlll; 

(* nIV is removed from the remaining population *) 
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end; 
5 writeln; 

writeln ( , koffA= , / koffA / 1 tl2A=',tl2A,' konA=',konA); 
writeln ( , koffB=' / koffB / ' tl2B= , ,tl2B/ konB=',konB); 
writeln ('linker length [m]= l / rad); 
writeln ('effective concentration [M]=',conc); 
10 writeln; 

writeln ( , tl2AB= , / tl2AB/ s'); 
writeln; 

readln; 
15 end. 



20 Example 5: 

In many cases, it may be desirable to improve the affinity of an existing binder 
towards a target molecule (e.g. a pharmaceutical target). Towards, this goal, 
ESACHEL technology can be used as follows, i.e. omitting the "code" oligonucleo- 
25 tide sequence from the binder to be optimized. Let us suppose that the chemical 
moiety p binds to a target molecule (e.g., a protein) with an insufficient affinity. 
It will be convenient to link p to one extremity (e.g., the 5' end) of a suitable oli- 
gonucleotide, capable of self-assembly with other oligonucleotide derivatives 
(typically, by heteroduplex formation, as depicted in Figure 10). 

30 

For example, the chemical moiety p will be coupled to the 5' end of oligonucleo- 
tide 5' - 5'- GGA GCT TCT GAA TTC TGT GTG CTG -3'. It will then be convenient 
to chemically couple, in individual reactions, many different chemical moieties q 
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at the 3' end of oligonucleotides, of general sequence 5' - XX XX - Y - CAG 

CAC ACA GAA TTC AGA AGC TCC - 3', whereas: 

the XX XX portion will be different for the different compounds; 

5 - Y represents a biotinylated base analogue; 

- the sequence 5' - CAG CAC ACA GAA TTC AGA AGC TCC - 3' will be identical 
in all cases, allowing the heteroduplex formation with the sequence 5' - GGA 
GCT TCT GAA TTC TGT GTG CTG - 3', chemically coupled to p, for all mem- 
bers of the ensamble of molecules q. 

10 

The resulting library will pair p with molecules q, each of which bears a distinc- 
tive oligonucleotide code. The self-assembled library can be submitted to biopan- 
ning, under conditions of suitable stringency. The binders rescued at the end of 
the biopanning procedure will be identified by means of their code. For example, 

15 the codes of the molecules q, which together with p give rise to high-affinity 
binders for the target molecule, can be read by hybridization to an oligonucleo- 
tide chip, in which the different positions are covered with oligonucleotides, which 

are complementary to the sequences XX XX of the members of the sub-library 

Q. The biotin moiety on members of sub-library Q will allow the detection of the 

20 binding events on the chip. 

Candidate chemical moieties q will then be chemically linked to p, and the re- 
sulting conjugate will be used as a specific binder for the target molecule of in- 
terest. 



25 
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Claims 

1. A chemical compound comprising a chemical moiety (p) capable of per- 
forming a binding interaction with a target molecule and further comprising 
an oligonucleotide (b) or functional analogue thereof, characterized in 
that the oligonucleotide (b) or functional analogue comprises at least one 
self-assembly sequence (bl) capable of performing a combination reaction 
with at least one self-assembly sequence (bl') of a complementary oligonu- 
cleotide or functional analogue bound to another chemical compound com- 
prising a chemical moiety (q). 

2. The chemical compound of claim 1, characterized in that the oligonucleo- 
tide (b) or functional analogue thereof further comprises a sequence tag 
(b2) coding for the identification of the chemical moiety (p). 

3. A chemical compound comprising a chemical moiety (p) capable of per- 
forming a binding interaction with a target molecule and further comprising 
an oligonucleotide (b) or functional analogue thereof, which comprises a 
coding sequence (bl) coding for the identification of the chemical moiety 
(p), characterized in that the chemical compound further comprises at 
least one self-assembly moiety (m) capable of performing a combination re- 
action with at least one self-assembly moiety (m') of a similar chemical 
compound comprising a chemical moiety (q). 

4. The chemical compound of claim 3, characterized in that the self- 
assembly moiety (m) is a self-assembly sequence (bl) of the oligonucleo- 
tide (b), a functional analogue thereof, a ligand (I) capable to perform a 
complex reaction with a specific ion, or a peptide capable of association with 
other molecules. 

5. The chemical compound of one of claims 1 to 4, characterized in that the 

oligonucleotide (b) or functional analogue thereof is directly linked to chemi- 
cal moiety (p). 
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6. The chemical compound of one of claims 1 to 5, characterized in that the 

oligonucleotide (b) or functional analogue thereof further comprises a link- 
ing portion (b3) which is situated between the self-assembly sequence (bl) 
and the chemical moiety (p). 

5 

7. The chemical compound of one of claims 2 or 5, characterized in that the 
coding sequence (b2) of oligonucleotide (b) or the functional analogue 
thereof is situated between the chemical moiety (p) and the self-assembly 
sequence (bl). 

10 

8. The chemical compound of one of claims 1, 2 or 4 to 7, characterized in 
that the at least one self-assembly sequence (bl) is capable of performing 
a combination reaction with the self-assembly sequences (bl 1 ) of two or 
more complementary oligonucleotides or functional analogues bound to 

15 other chemical compounds comprising a chemical moiety (q,r,s ...) in order 

to form oligomers like e.g. trimers or tetramers. 

9. A library of chemical compounds comprising a chemical moiety (p) capable 
of performing a binding interaction with a target molecule (e.g. a biological 

20 target) and further comprising an oligonucleotide (b) or functional analogue 

thereof, characterized in that the oligonucleotide (b) or functional ana- 
logue comprises at least one self-assembly sequence (bl) capable of per- 
forming a combination reaction with a self-assembly sequence (bl') of a 
complementary oligonucleotide or functional analogue bound to another 

25 chemical compound comprising a chemical moiety (q). 



10. A library of chemical compounds comprising a chemical moiety (p) capable 
of performing a binding interaction with a target molecule (e.g. a biological 
target) and further comprising an oligonucleotide (b) or functional analogue 
30 thereof, which comprises a coding sequence (bl) coding for the identifica- 

tion of the chemical moiety (p), characterized in that the chemical com- 
pound further comprises at least one self-assembly moiety (m) capable of 
performing a combination reaction with at least one self-assembly moiety 
(m') of a similar chemical compound comprising a chemical moiety (q). 
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11. The library of claim 10, characterized in that the self-assembly moiety 
(m) is a self-assembly sequence (bl) of the oligonucleotide (b), a functional 
analogue thereof, or a ligand (I) capable to perform a complex reaction with 
a specific ion (i). 

5 

12. The library according to one of claims 9 to 11, characterized in that it 

comprises chemical compounds according to any one of claims 2 or 5 to 8. 

13. The library according to any of claims 9, 11 or 12, characterized in that 

10 its individual combinations of moieties (p,q,r,s ...) is derived by forming oli- 

gomers like e.g. trimers or tetramers, i.e. by heterooligomerization of the 
self-assembly sequences (bl,bl') of the oligonucleotide (b) forming hetero- 
duplexes, heterotriplexes or heteroquadruplexes. 

15 14. The library according to claim 10, characterized in that its individual com- 
binations of moieties (p,q,r,s ...) is derived by chelation of the self-assembly 
moieties (m,m f ) with specific ions (i). 

15. The library according to claim 13, characterized in that it comprises indi- 
20 vidually encoded sub-libraries (A) and (B), whereas sub-library (A) com- 

prises n compounds coupled to the 3' extremity of n different DNA oligonu- 
cleotides (b) and sub-library (B) comprises m compounds coupled to the 5' 
extremity of m different DNA oligonucleotides (b'). 



25 16. The library according to claim 15, characterized in that in sub-library (A) 
or in sub-library (B) respectively, iodoacetamido- or maleido-derivatives of 
n or m chemical entities have been coupled to individual DNA oligonucleo- 
tides which carry a thiol group at the 3' or 5' end. 



30 17. The library according to claim 15, characterized in that in sub-library (A) 
or in sub-library (B) respectively, amide derivatives - forming chemical 
structures such as -0-P(0) 2 -0-(CH 2 )n-NH-CO-R, where R may correspond 
to a number of different chemical entities, and n may range between 1 and 
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10 - have been coupled to the oligonucleotides carrying a phosphodiester 
bond at one extremity. 

18. The library according to one of claims 15 to 17, characterized in that in 
sub-library (A) the self-assembly sequence (bl) is interrupted by a d-spacer 
in opposite position to a code (B), the d-spacer preventing any undesired 
pairing to the bases of code (B) which encodes sub-library (B), whereas the 
oligonucleotide (b) of sub-library (A) has its distinctive code (A) towards the 
5' extremity. 

19. A method of biopanning ligands specific for target molecules (e.g. biological 
targets), characterized in that a chemical compound comprising a chemi- 
cal moiety (p) capable of performing a binding interaction with a target 
molecule and further comprising an oligonucleotide (b) or functional ana- 
logue thereof, wherein the oligonucleotide (b) or functional analogue com- 
prises at least one self-assembly sequence (bl) capable of performing a 
combination reaction with at least one self-assembly sequence (bl 1 ) of a 
complementary oligonucleotide or functional analogue bound to another 
chemical compound comprising a chemical moiety (q) is incubated with a 
target molecule and the resulting complex of the target and a chemical 
compound is physically separated from chemical compounds which have not 
bound to the target. 

20. The method of claim 19, characterized in that a chemical compound ac- 
cording to at least one of claims 1 to 8 is used for biopanning. 

21. The method of claim 19 or 20, characterized in that a library of chemical 
compounds according to at least one of claims 9 to 18 is used for biopan- 
ning. 

22. A method to identify a target molecule (e.g. a biological target) with a 
chemical compound comprising a chemical moiety (p) capable of performing 
a binding interaction with this target molecule and further comprising an oli- 
gonucleotide (b) or functional analogue thereof characterized in that the 
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chemical compound is bound to a target by biopanning according to at least 
one of claims 19 to 21. 

23. The method of claim 22, characterized in that PCR-fragments are gener- 
ated by polymerase chain reaction (PCR), each of which carries the code of 
pairs of sub-library members (A) and (B), whereas sub-library (A) com- 
prises n compounds coupled to the 3' extremity of n different DNA oligonu- 
cleotides (b) and sub-library (B) comprises m compounds coupled to the 5' 
extremity of m different DNA oligonucleotides (b 1 ). 

24. The method of claim 23, characterized in that in sub-library (A) or in sub- 
library (B) respectively, iodoacetamido- or maleido-derivatives of n or m 
chemical entities are coupled to individual DNA oligonucleotides which carry 
a thiol group at the 3' or 5' end. 

25. The method of claim 24, characterized in that in sub-library (A) the self- 
assembly sequence (bl) is interrupted by a d-spacer in opposite position to 
a code (B), the d-spacer preventing any undesired pairing to the bases of 
code (B) which encodes sub-library (B), whereas the oligonucleotide (b) of 
sub-library (A) has its distinctive code (A) towards the 5' extremity. 

26. The method of at least one of claims 22 to 25, characterized in that the 
length of the PCR-fragments are checked and their sequence identity is es- 
tablished by digesting the PCR-fragments with a restriction site for a specific 
endopeptidase (e.g. EcoRI), followed by cloning into a suitable plasmid and 
sequencing. 

27. The method of at least one of claims 22 to 26 where several specific binding 
members are isolated at the end of a biopanning experiment, character- 
ized in that concatenamers are created, starting from the various PCR- 
fragments present in the reaction mixture, the concatenated sequences are 
"read" by sequencing, revealing both the identity and the frequency of pairs 
of code (A) and code (B). 
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28. The method of claim 23 where several specific binding members are isolated 
at the end of a biopanning experiment and sub-libraries (A) and/or (B) carry 
chemical moieties at the extremities of partially-annealing oligonucleotides 
characterized in that unpaired DNA strands are hybridized with target oli- 

5 gonucleotides (e.g. DNA oligonucleotides) being immobilized on one or more 

chips. 

29. The method of claim 28, characterized in that by using chip (A) or chip 
(B) respectively, the reading of the identity and/or frequency of members of 

10 sub-library (A) or sub-library (B) respectively, rescued after a biopanning 

experiment, is carried out and by decoding on chip (A) and (B) candidate 
components of sub-libraries (A) and (B), to be re-annealed and screened in 
a successive round of bio-panning are suggested. 

15 30. The method of claim 29, characterized in that increasingly stringent 

binding to the target is mirrored by a reduction in the number of (A) and/or 
(B) members as identified on the respective chip and the possible combina- 
tions of the candidate (A) and (B) members are assembled individually or in 
smaller pools and assayed for binding to the target. 

20 

31. The method of at least one of the claims 28 to 30, characterized in that 
libraries are allowed to self-assemble in order to form trimeric or tetrameric 
complexes (e.g. using DNA triplexes or quadruplexes for the oligomerization 
of compounds) by using three or four chips, respectively, which carry dis- 

25 tinctive target oligonucleotides for decoding. 

32. The method of at least one of the claims 38 to 31, characterized in that 
the DNA of selected binding moieties is PCR amplified prior to chip hybridi- 
zation. 

30 
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AMENDED CLAIMS 

[received by the International Bureau on 14 July 2003 (14.07.03); 
original claims 1-32 ammended ] 

1. A chemical compound comprising a chemical moiety (p) potentially capable 
of performing a binding interaction with a target molecule, the chemical 

5 compound comprising an oligonucleotide (b) or functional analogue thereof 

that comprises a coding sequence (b2) coding for the identification of the 
chemical moiety (p), characterized in that the chemical compound further 
comprises at least one self-assembly moiety (m) capable of performing a 
combination reaction and of building a stable combination reaction product 

10 with a self-assembly moiety (m') of at least one similar chemical compound 

with a chemical moiety (q), being potentially capable of performing a bind- 
ing interaction with this target molecule. 

2. The chemical compound of claim 1, characterized in that it is capable of 
15 building a stable combination reaction product with a similar chemical com- 
pound that further comprises an oligonucleotide (b') or functional analogue 
thereof that comprises a coding sequence (b2') coding for the identification 
of the chemical moiety (q). 



20 3. The chemical compound of one of claims 1 or 2, characterized in that the 

self-assembly moiety (m) is a self-assembly sequence (bl) of the oligonu- 
cleotide (b), a functional analogue thereof, a ligand (1) capable to perform a 
complex reaction with a specific ion (i), or a peptide capable of association 
with other molecules. 

25 

4. The chemical compound of one of claims 1 to 3, characterized in that it 

comprises at least one chemical group that is capable of forming a covalent 
bond with at least one respective chemical group of a similar chemical com- 
pound with which had been formed a stable combination reaction product. 

30 

5. The chemical compound of one of claims 1 to 4, characterized in that the 

oligonucleotide (b) or functional analogue thereof is covalently and directly 
linked to chemical moiety (p). 
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6. The chemical compound of one of claims 1 to 5, characterized in that the 

oligonucleotide (b) or functional analogue thereof further comprises a link- 
ing portion (b3) which is situated between the self-assembly sequence (bl) 
and the chemical moiety (p). 

5 

7. The chemical compound of one of claims 1 to 4 or 6, characterized in that 

the coding sequence (b2) of oligonucleotide (b) or the functional analogue 
thereof is situated between the chemical moiety (p) and the self-assembly 
sequence (bl), 

10 

8. The chemical compound of one of claims 1 to 7, characterized in that the 

at least one self-assembly sequence (bl) is capable of performing a combi- 
nation reaction with the self-assembly sequences (bl 1 ) of two or more com- 
plementary oligonucleotides or functional analogues bound to other cherni- 
15 cal compounds comprising a chemical moiety (q,r,s —) in order to form oli- 

gomers like e.g. trimers or tetramers. 



9. A library of chemical compounds that comprise a chemical moiety (p) poten- 
tially capable of performing a binding interaction with a target molecule, the 

20 chemical compound comprising an oligonucleotide (b) or functional analogue 

thereof that comprises a coding sequence (b2) coding for the identification 
of the chemical moiety (p), characterized in that the chemical compounds 
further comprise at least one self-assembly moiety (m) capable of perform- 
ing a combination reaction and of building a stable combination reaction 

25 product with a self-assembly moiety (m') of at least one similar chemical 

compound with a chemical moiety (q), being potentially capable of perform- 
ing a binding interaction with this target molecule. 



10. The library of claim 9, characterized in that the chemical compounds are 
30 capable of building a stable combination reaction product with a similar 

chemical compound that further comprises an oligonucleotide (b') or func- 
tional analogue thereof that comprises a coding sequence (b2') coding for 
the identification of the chemical moiety (q). 
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11. The library of one of claims 9 or 10, characterized in that the self- 
assembly moiety (m) is a self-assembly sequence (bl) of the oligonucleo- 
tide (b) ; a functional analogue thereof, or a llgand (I) capable to perform a 
complex reaction with a specific ion (I), or a peptide capable of association 

5 with other molecules. 

12. The library according to one of claims 9 to 11, characterized in that it 

comprises chemical compounds according to any one of claims 3 to 8. 



10 13, The library according to any of claims 9, 11 or 12, characterized in that 

its individual combinations of moieties (p,q,r,s is derived by forming oli- 
gomers like e.g, dimers, trimers, ortetramers, i.e. by heterooligomerization 
of the self-assembly sequences (bl f bl v ) of the oligonucleotide (b,b') form- 
ing heteroduplexes, heterotriplexes or heteroquadruplexes. 

15 

14. The library according to claim 9, characterized in that its individual com- 
binations of moieties (p,q,r,s „.) is derived by chelation of the self-assembly 
moieties (nr^m 1 ) with specific ions (i). 



20 15. The library according to claim 13, characterized in that it comprises indi- 
vidually encoded sub-libraries (A) and (B), whereas sub-library (A) com- 
prises n compounds coupled to the 3' extremity of n different DNA oligonu- 
cleotides (b) and sub-library (B) comprises m compounds coupled to the 5' 
extremity of m different DNA oligonucleotides (b'). 

25 

16. The library according to claim 15, characterized in that in sub-library (A) 
or in sub-library (B) respectively, iadoacetamido- or maleido-derivatives of 
n or m chemical entities have been coupled to individual DNA oligonucleo- 
tides which carry a thiol group at the 3 1 or 5' end. 

30 

17. The library according to claim 15, characterized in that in sub-library (A) 
or in sub-library (B) respectively, amide derivatives - forming chemical 
structures such as ~0-P(0)2-0-(CH 2 )n-NH-CO-R, where R may correspond 
to a number of different chemical entitles, and n may range between 1 and 
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10 - have been coupled to the oligonucleotides carrying a phosphodiester 
bond at one extremity. 

18. The library according to one of claims IS to 17, characterized in that in 
sub-library (A) the self-assembly sequence (bl) is Interrupted by a d-spacer 
in opposite position to a code (B), the d-spacer preventing any undesired 
pairing to the bases of code (B) which encodes sub-library (B), whereas the 
oligonucleotide (b) of sub-library (A) has its distinctive code (A) towards the 
5' extremity. 

19. A method of biopanning ligands specific for target molecules, wherein a 
chemical compound comprising a chemical moiety (p) potentially capable of 
performing a binding interaction with a target molecule, the chemical com- 
pound comprising an oligonucleotide (b) or functional analogue thereof that 
comprises a coding sequence (b2) coding for the identification of the chemi- 
cal moiety (p), characterized in that the oligonucleotide (b) or functional 
analogue comprises at least one seff-assembiy sequence (bl) capable of 
performing a combination reaction and of building a stable combination re- 
action product with at least one self-assembly sequence (bl') of at least one 
complementary oligonucleotide or functional analogue of another chemical 
compound with a chemical moiety (q), being potentially capable of perform- 
ing a binding interaction with this target molecule is incubated with a target 
molecule and the resulting complex of the target and the stable combination 
reaction product of chemical compounds is physically separated from chemi- 
cal compounds which have not bound to the target. 

20. The method of claim 19, characterized in that a chemical compound ac- 
cording to at (east one of claims 1 to 8 is used for biopanning. 

21. The method of claim 19 or 20, characterized in that a library of chemical 
compounds according to at least one of claims 9 to 18 is used for biopan- 
ning. 
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22. A method to identify a target molecule with a chemical compound compris- 
ing a chemical moiety (p) capable of performing a binding interaction with 
this target molecule and further comprising an oligonucleotide (b) or func- 
tional analogue thereof characterized in that the chemical compound is 
5 bound to a target by biopanning according to at least one of claims 19 to 

21, 



23. The method of claim 22, characterized in that PCR-fragments are gener- 
ated by polymerase chain reaction (PCR), each of which carries the code of 
10 pairs of sub-library members (A) and (B), whereas sub-library (A) com- 

prises n compounds coupled to the 3* extremity of n different DNA oligonu- 
cleotides (b) and sub-library (B) comprises m compounds coupled to the 5' 
extremity of m different DNA oligonucleotides (b l ), 

15 24. The method of claim 23, characterized in that in sub-library (A) or in sub- 
library (B) respectively, iodoacetamido- or maleido-derivatives of n or m 
chemical entities are coupled to individual DNA oligonucleotides which carry 
a thiol group at the 3' or S' end. 

20 25. The method of claim 24, characterized in that in sub-library (A) the self- 
assembly sequence (bl) is Interrupted by a d-spacer in opposite position to 
a code (B), the d-spacer preventing any undesired pairing to the bases of 
code (B) which encodes sub-library (B), whereas the oligonucleotide (b) of 
sub-library (A) has its distinctive code (A) towards the 5' extremity. 

25 

26. The method of at least one of claims 22 to 25, characterized in that the 
length of the PCR-fragments are checked and their sequence identity is es- 
tablished by digesting the PCR-fragments with a restriction site for a specific 
endopeptidase (e.g. EcoKL), followed by cloning Into a suitable plasmid and 

30 sequencing- 

27, The method of at least one of claims 22 to 26 where several specific binding 
members are isolated at the end of a biopanning experiment, character- 
ized in that concatenamers are created, starting from the various PCR- 
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fragments present in the reaction mixture, the concatenated sequences are 
"read" by sequencing, revealing both the identity and the frequency of pairs 
of code (A) and code (B), 

5 28. The method of claim 23 where several specific binding members are isolated 
at the end of a biopanning experiment and sub-libraries (A) and/or (B) carry 
chemical moieties at the extremities of partially-annealing oligonucleotides 
characterized in that unpaired DNA strands are hybridized with target oli- 
gonucleotides (e.g. DNA oligonucleotides) being immobilized on one or more 
10 chips. 

29. The method of claim 28, characterized in that by using chip (A) or chip 
(B) respectively, the reading of the identity and/or frequency of members of 
sub-library (A) or sub-library (B) respectively, rescued after a biopanning 
15 experiment, is carried out and by decoding on chip (A) and (B) candidate 

components of sub-libraries (A) and (B), to be re-annealed and screened in 
a successive round of bio-panning are suggested. 

The method of claim 29, characterized in that increasingly stringent bind- 
ing to the target is mirrored by a reduction in the number of (A) and/or (B) 
members as identified on the respective chip and the possible combinations 
of the candidate (A) and (B) members are assembled individually or in 
smaller pools and assayed for binding to the target. 

25 31. The method of at least one of the claims 28 to 30, characterized in that 

libraries are allowed to self-assemble In order to form trimeric or tetrameric 
complexes (e.g. using DNA triplexes or quadruplexes for the oligomerization 
of compounds) by using three or four chips, respectively, which carry dis- 
tinctive target oligonucleotides for decoding. 

30 

32. The method of at least one of the claims 28 to 31, characterized in that 

the DNA of selected binding moieties is PCR amplified prior to chip hybridh 
zation. 
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